Main product detection with graph networks for fashion
نویسندگان
چکیده
Abstract Computer vision has established a foothold in the online fashion retail industry. Main product detection is crucial step of vision-based feed parsing pipelines, focused on identifying bounding boxes that contain being sold gallery images page. The current state-of-the-art approach does not leverage relations between regions image, and treats same independently, therefore fully exploiting visual contextual information. In this paper, we propose model incorporates Graph Convolutional Networks (GCN) jointly represent all detected as nodes. We show proposed method better than state-of-the-art, especially, when consider scenario where title-input missing at inference time for cross-dataset evaluation, our outperforms previous approaches by large margin.
منابع مشابه
Convolutional Neural Networks for Fashion Classification and Object Detection
Fashion classification encompasses the identification of clothing items in an image. The field has applications in social media, e-commerce, and criminal law. In our work, we focus on four tasks within the fashion classification umbrella: (1) multiclass classification of clothing type; (2) clothing attribute classification; (3) clothing retrieval of nearest neighbors; and (4) clothing object de...
متن کاملislanding detection methods for microgrids
امروزه استفاده از منابع انرژی پراکنده کاربرد وسیعی یافته است . اگر چه این منابع بسیاری از مشکلات شبکه را حل می کنند اما زیاد شدن آنها مسائل فراوانی برای سیستم قدرت به همراه دارد . استفاده از میکروشبکه راه حلی است که علاوه بر استفاده از مزایای منابع انرژی پراکنده برخی از مشکلات ایجاد شده توسط آنها را نیز منتفی می کند . همچنین میکروشبکه ها کیفیت برق و قابلیت اطمینان تامین انرژی مشترکان را افزایش ...
15 صفحه اولMulti-Modal Fashion Product Retrieval
Finding a product in the fashion world can be a daunting task. Everyday, e-commerce sites are updating with thousands of images and their associated metadata (textual information), deepening the problem. In this paper, we leverage both the images and textual metadata and propose a joint multi-modal embedding that maps both the text and images into a common latent space. Distances in the latent ...
متن کاملGraph Convolutional Networks with Argument-Aware Pooling for Event Detection
The current neural network models for event detection have only considered the sequential representation of sentences. Syntactic representations have not been explored in this area although they provide an effective mechanism to directly link words to their informative context for event detection in the sentences. In this work, we investigate a convolutional neural network based on dependency t...
متن کاملRandom Dot Product Graph Models for Social Networks
Inspired by the recent interest in combining geometry with random graph models, we explore in this paper two generalizations of the random dot product graph model proposed by Kraetzl, Nickel and Scheinerman, and Tucker [1, 2]. In particular we consider the properties of clustering, diameter and degree distribution with respect to these models. Additionally we explore the conductance of these mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Multimedia Tools and Applications
سال: 2022
ISSN: ['1380-7501', '1573-7721']
DOI: https://doi.org/10.1007/s11042-022-13572-x